Enhanced Server Fault Tolerance for Improved User Experience ; CU-CS-1037-08
نویسندگان
چکیده
Interactive applications such as email, calendar, and maps are migrating from local desktop machines to data centers due to the many advantages offered by such a computing environment. Furthermore, this trend is creating a marked increase in the deployment of servers at data centers. To ride the price/performance curves for CPU, memory and other hardware, inexpensive commodity machines are the most cost effective choices for a data center. However, due to low availability numbers of these machines, probability of server failures is relatively high. Server failures in turn cause service outages, degrade user experience and eventually result in lost revenue for businesses. This paper proposes a TCP splice based web server architecture that seamlessly tolerates both web proxy and backend server failures. The client TCP connection and sessions are preserved, and failover to alternate servers in case of server failures is fast and client transparent. The architecture provides support for both deterministic and non-determinsitic server applications. A prototype of this architecture has been implemented in Linux, and the paper presents detailed performance results for a PHP based webmail application deployed over this architecture.
منابع مشابه
Enhanced Server Fault-tolerance Techniques for Seamless User Experience
User applications, such as email, calendar, maps, are migrating from local desktop machines to data centers due to the many advantages offered by such a computing paradigm. Furthermore, this trend is creating a marked increase in the deployment of servers at data centers. To ride the price/performance curves for CPU, memory and other HW, inexpensive commodity machines although having low availa...
متن کاملStability Assessment Metamorphic Approach (SAMA) for Effective Scheduling based on Fault Tolerance in Computational Grid
Grid Computing allows coordinated and controlled resource sharing and problem solving in multi-institutional, dynamic virtual organizations. Moreover, fault tolerance and task scheduling is an important issue for large scale computational grid because of its unreliable nature of grid resources. Commonly exploited techniques to realize fault tolerance is periodic Checkpointing that periodically ...
متن کاملCs 230 Distributed Systems Winter '08 Term Project Fault Tolerance Mechanisms in Distributed Systems
متن کامل
Improving the palbimm scheduling algorithm for fault tolerance in cloud computing
Cloud computing is the latest technology that involves distributed computation over the Internet. It meets the needs of users through sharing resources and using virtual technology. The workflow user applications refer to a set of tasks to be processed within the cloud environment. Scheduling algorithms have a lot to do with the efficiency of cloud computing environments through selection of su...
متن کاملMobile Thin-Client System with Fault Tolerance and Scalability by "HTTP-FUSE-KNOPPIX-BOX"
We have developed a mobile thin-client server named ”HTTP-FUSE-KNOPPIX-BOX” that incorporates mechanisms for increasing performance, distributing load, and providing fault tolerance to achieve a practical HTTP-FUSE-KNOPPIX-based thin-client environment for mobile systems and other environments with limited hardware resources such as CPU and memory. Software RAID raises file input/output perform...
متن کامل